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Breast cancer is the second leading cause of mortality among female cancer 
patients worldwide. Early detection of breast cancer is considerd as one of 
the most effective ways to prevent the disease from spreading and enable 
human can make correct decision on the next process. Automatic diagnostic 
methods were frequently used to conduct breast cancer diagnoses in order to 
increase the accuracy and speed of detection. The fuzzy-ID3 algorithm with 
association function implementation (FID3-AF) is proposed as a 
classification technique for breast cancer detection. The FID3-AF algorithm 
is a hybridisation of the fuzzy system, the iterative dichotomizer 3 (ID3) 
algorithm, and the association function. The fuzzy-neural dynamic- 
bottleneck-detection (FUZZYDBD) is considered as an automatic fuzzy 
database definition method, would aid in the development of the fuzzy 
database for the data fuzzification process in FID3-AF. The FID3-AF 
overcame ID3’s issue of being unable to handle continuous data. The 
association function is implemented to minimise overfitting and enhance 
generalisation ability. The results indicated that FID3-AF is robust in breast 
cancer classification. A thorough comparison of FID3-AF to numerous 
existing methods was conducted to validate the proposed method’s 
competency. This study established that the FID3-AF performed well and 
outperform other methods in breast cancer classification. 
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1. INTRODUCTION 


Breast cancer is one of the most lethal types of cancer for women worldwide, ranking as the second- 
largest cause of death for female cancer patients [1], [2]. Over 250,000 women are diagnosed with invasive 
breast cancer each year in the United States [3]. Breast cancer is primarily caused by family history, as patients 
inherit genetic abnormalities in their genes [4]. The disease has a rapid metastasis process, enabling the primary 
tumour to rapidly spread breast cancer cells to other organs [5]. Breast cancer’s metastatic features are a 
significant factor in its high incurability rate [6], [7]. Breast cancer therapy advances have resulted in a decrease 
in breast cancer mortality rates across all age groups [8]-[10]. Younger women, on the other hand, had a poorer 
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survival rate and a greater risk of local recurrence [1]. Apart from establishing prophylactic methods, early 
detection of breast cancer is critical to halt the disease’s rapid progression [6]. 

While a clinician can identify breast cancer manually, it would take an extended length of time and 
much effort to conduct the diagnosis or classification for breast cancer [11]-[16]. Thus, early diagnosis of breast 
cancer using an intelligent system, such as automatic diagnostic methods, is critical in the medical field 
[17]-[22]. Classification can be accomplished using well-known machine learning techniques such as deep 
neural networks, support vector machines, k-nearest neighbour (KNN), random forest, and decision trees [11], 
[14], [18]-[43]. This article presents the fuzzy-ID3 algorithm with an association function (FID3-AF) to 
increase the efficiency of breast cancer classification. The advantages of this strategy include the decision tree’s 
excellent intelligibility, interpretability, and ability to deal with erroneous and uncertain data in a fuzzy 
representation [13], [14], [16], [44]-[48]. However, with FID3-AF, which does data fuzzification, it is required 
to define the fuzzy database. 

The automatic definition of fuzzy databases using formal methods such as genetic algorithms and 
fuzzy clustering algorithms are computationally costly and do not always produce the best results because there 
are no general rules for selecting the most appropriate method for each specific domain [44], [49]-[54]. Thus, 
this article implemented the fuzzy-neural dynamic-bottleneck-detection (FUZZYDBD), an automatic fuzzy 
database definition method proposed by Cintra [44], to create the fuzzy database in FID3-AF. The FID3-AF 
approach was proposed to create a classification system that is both rapid and effective. FID3-AF fuzzified data 
and substituted numerical values with linguistic variables from fuzzy sets with the highest degree of 
compatibility. Thus, it overcomes the classic ID3 algorithm’s problem of being unable to classify continuous 
data. The association function is implemented in the decision tree to address the overfitting and poor 
generalisation issues associated with the standard ID3 [55], [56]. Testing and verification processes are used to 
validate the method’s performance. The remainder of this work is structured as: the materials and methods, the 
results, the discussion, and the conclusion. 


2. METHOD 
2.1. Fuzzy system 

Zadeh [57] proposed the fuzzy logic in 1965. It is a precise imprecision logic based on the degree of 
truth. The fuzzy system, which is based on fuzzy logic, operates through the use of logical variables with 
values ranging from 0 to 1. It is typically used to address data imprecision issues through the use of fuzzy set 
theory. The fuzzy system is composed of three key processes: fuzzification, inference engine, and 
defuzzification. Fuzzy systems need granulation of the domain’s features, which are represented as fuzzy sets 
and partitions, in order to do fuzzification using a fuzzifier [58]. Fuzzy sets and partitions would be used to 
create the membership functions. Each fuzzy set is individually described by a single uniquely stated 
membership function [59]. As a result, the specific membership functions are usually identified by the labels 
of the associated fuzzy sets. For each of the fuzzy sets, each input factor is allocated a range of values [60]. 
Membership functions are used to record the degree of membership in fuzzy sets and to grade their 
membership. 

The rule-based fuzzy system consists of a knowledge base and an inference engine [61]. The 
knowledge base was divided into two parts: a fuzzy rule base (FRB) holding a set of fuzzy rules for the 
dataset and a fuzzy database containing the definitions of the fuzzy sets, including the linguistic variables 
used in the FRB [62]. Meanwhile, the inference process generated the system’s outputs via fuzzy reasoning. 
The inference engine, which also can be referred to as the inference mechanism, would use fuzzy rules to 
translate the input to the output. The inference engine generates the most relevant consequents for each rule. 
When building a fuzzy inference system (FIS), the connection between fuzzy parameters such as rules and 
membership functions is essential [63]. The following is how the fuzzy rule is expressed: 


IF (V1 i IS Ali) AND (V2 iIS A2 i) AND...(Vj i IS Aj i) THEN (CLASS: IS Ci) 


The linguistic variable V has been used to represent the linguistic variable of the attribute, whereas 
A is the linguistic value of the data. Following that, j denotes the number of data attributes, whereas i denotes 
the number of rules. The primary distinguishing characteristics of a fuzzy system are incorporating linguistic 
variables, the interdependence of variables via conditional rules, and applying the fuzzy technique to validate 
complicated relationships [64]. Incorporating linguistic variables implies that the variables can be interpreted 
in natural language using the fuzzy technique. The interdependence of variables via conditional rules 
indicates that linguistic variables in the antecedent portion of fuzzy rules express as characteristics, but those 
in the consequent part express as classes. Validation of complicated dependency via fuzzy approaches allows 
for the justification of the link between a class and a linguistic variable via fuzzy logic. Finally, 
defuzzification is used when precise values are necessary, and the resulting values are aggregated to create 
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precise output. Defuzzification is the process of converting a fuzzy output to a crisp value by utilising the 
fuzzy set and membership degree [60], [65]. The most well-known defuzzification methods are the centre of 
gravity (COG), mean of maximum (MOM), and centre average [66]. The fuzzy system is a popular method 
in machine learning since it is easier than other methods and can achieve high accuracy rates simultaneously 
[67]. 

According to [68], the adoption of the fuzzy approach is effective at detecting breast cancer, with an 
accuracy of greater than 98%. Furthermore, according to [69], when used to a histopathology image dataset, 
the fuzzy model attained an accuracy of 94.26%. Meanwhile, Schaefer et al. [70] shown that when applied a 
fuzzy technique, the experimental findings of breast cancer thermogram classification reached an accuracy of 
80%. The superior accuracy results of prior researches have demonstrated the efficiency of the fuzzy 
technique in classifying data. 


2.2. Decision tree: ID3 algorithm 

In machine learning, a decision tree (DT) is a well-known classification method [71]-[76]. 
Numerous advantages exist for using DT in classification, including superior interpretability, scalability, and 
the ability to express in both graphical and textual formats [19], [77]-[79]. The three most well-known 
decision tree learning algorithms are ID3, C4.5, and CART. ID3 is chosen to examine decision tree learning 
in this research since it is the most often used learning algorithm [80]—[82]. In 1986, Quinlan created the 
iterative dichotomizer 3 (ID3) [83], [84]. The ID3 approach is based on recursive partitioning, in which the 
training data is divided into subsets, and the subsets produce the decision tree’s partitions [77], [85]. The ID3 
algorithm selects attributes based on Shannon’s entropy and information gain [85]. The information entropy 
is determined using (1), where S represents sets that are classified into multiple classes and p; denotes the 
proportion of data that is classified into each class. Equation (2) illustrates the computation of information 
gain, where A represents the current attribute, S indicates sets that are separated into various classes, and n 
denotes the potential values for the corresponding attribute. Sj signifies the subsets that have the same credits 
attribute A. 


k 
Entr(S) = —) pilogs( pi) (1) 
i=1 


n 
Is 


Gain(A,S) = Entr(S) — 5] 
j=1 


Entr (S;) (2) 


The ID3 algorithm prioritises the largest information gain in order to determine the most acceptable 
qualities in each tree branching. The branching procedure is repeated repeatedly using a greedy top-down 
technique until the decision tree satisfies the termination conditions, such as all attributes in the datasets are 
fully utilised or all balance instances have the same class. Classification using ID3 is only possible if the 
dataset in question contains more than one class [83]. The ID3 algorithm creates rules for class prediction 
while also highlighting the properties or characteristics of the corresponding class [86]. The generated 
decision tree size is small with ID3 algorithm implementation due to the algorithm’s usage of quality 
measures and logical reasoning. 

Angayarkanni and Kamal [87] evaluated the ID3 algorithm’s performance and capability for breast 
cancer domain classification using an magnetic resonance imaging (MRI) mammogram image dataset. The 
aforementioned dataset has three distinct classes: benign, malignant, and normal. The average accuracy of the 
ID3 algorithm was 99.9%, and the training time of the algorithm only takes 0.03 seconds. It illustrates the 
algorithm’s ability to get correct classification results in a reasonable duration. According to Yang et al. [88], 
the algorithm achieved a prediction accuracy of 90.56% using the wisconsin breast cancer dataset. 
Additionally, Jacob and Ramani [89] demonstrated that when used with the wisconsin prognostic breast 
cancer dataset (WPBC), ID3 beat other classifiers in terms of classification accuracy, including naive Bayes 
and partial least squares for classification (C-PLS). There are several advantages to using the ID3 algorithm 
for DT learning, and the most significant is its short execution time [56], [90]. 


2.3. Fuzzy-neural dynamic-bottleneck-detection (FUZZYDBD) method 

The fuzzy automated definition technique is crucial for developing a fuzzy database [49]. The three 
components of developing a fuzzy database are the type of the membership functions, followed by the 
number and distribution of fuzzy sets associated with each domain characteristic [44]. Numerous techniques, 
such as the evolutionary algorithm, the artificial neural network, and the KNN, may be used to construct 
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fuzzy databases to estimate the number of fuzzy sets. Despite the fact that numerous methods exist, it is 
critical to emphasise that many studies define the number of fuzzy sets empirically and then distribute them 
evenly for the membership function due to the high complexity of the available methods and the adaptability 
of fuzzy logic to achieve improved performance [49]. Additionally, there is a dearth of consensus on the most 
effective existing techniques for certain applications and domains [91]. The study identified the FUZZYDBD 
technique, proposed in [49], as a quick, simple, and effective method for defining fuzzy databases [9]. 
Furthermore, it had been empirically evaluated in several other fields, including breast cancer. The 
FUZZYDBD method collects all essential features to develop a fuzzy database and overcomes the limitations 
of previous techniques for developing fuzzy databases [49], [91], [92]. According to Cintra [44], the 
FUZZYDBD method employs a variety of techniques, including the Wang Mendel method [93] for 
determining the total number of fuzzy sets for all attributes, the equalised universe method for partitioning of 
fuzzy sets across domains and the usage of triangular membership functions. 


Distribution of triangular fuzzy 


sets with Equalized Universe 
method (equal partitioning) 


Estimate number of fuzzy sets 
using Wang Mendel method Fuzzy 
(standardise number of fuzzy sets) Database 


Figure 1. Approaches of the FUZZYDBD method 


In the approach of the FUZZYDBD method, the number of fuzzy sets is uniform and consistent 
across all attributes in the dataset and can be ranging from two to ten triangular membership functions. The 
appropriate number of fuzzy sets between the ranges is determined empirically. Additionally, a domain 
expert can develop more suitable fuzzy set values for the attributes [44], [94]. The Wang-Mendel technique 
uses the same number of fuzzy sets, distribution, and shape of fuzzy sets to define all attributes in the 
obtained dataset [44]. Although the number of fuzzy sets can be defined in the range of two to ten, the 
FUZZYDBD method is more usually used with three, five, or seven [44], [95], [96]. Both two and three are 
the most conservative estimates of the number of fuzzy sets with the lowest error rate for the breast cancer 
dataset [49]. 

Chen and Wang devised the equalised universe method in 1999, and this technique was later 
incorporated by the FUZZYDBD method [44]. The approach is applied with the same width for each fuzzy 
set to ensure that the fuzzy sets in the domains’ attributes are partitioned equally. The attribute’s maximum 
value is located at the peak of the most right triangular membership function, while the attribute’s minimum 
value is located at the peak of the most left triangular membership function. When triangular shape 
membership functions with half overlap are applied, no region has a membership degree greater than 0.5. 
This method is a frequently used technique in the literature [44]. One of this technique’s primary advantages 
is that it generates extremely simple fuzzy databases. 


2.4. Association function 

The association function (AF), developed from the correlation function, emphasises the association 
between randomly chosen variables [86]. One of the benefits of AF implementation is that it overcomes the 
limitation of the conventional ID3 algorithm, which tends to choose attributes with a large number of 
attribute values and illustrates the link between the attributes and their corresponding class attributes 
concurrently [97]. The implementation begins with the computation of AF for each attribute in the dataset. 
For example, there are multiple attributes in dataset D, and let C be any of the dataset’s class attributes. Thus, 
(3) depicts the degree function of the relationship between S, which represents one of the attributes, and C. 
Equation (3) lists the representation, where n denotes the number of attribute types in S. Both x,,and 
Xj illustrate two distinct situations, with dataset D’s attribute S representing the ith value and category C 
representing the jth value (j=1,2) [82]. Then, the computation value of the normalised relation degree 
function, also known as the normalised gain, is performed. Equation (4) is the equation for normalised gain 
for each attribute, where m is the total number of attributes in the dataset. 
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AF(S) = ea Mis = Ml és 


AF (k) 
AF (1) + AF(2)+... +AF(m) (4) 


V(k) = 


Finally, the initial (old) information gain has been combined with a normalised gain, as shown in 
(5), where Gain’ (A) denotes the new gain. The normalised gain value is later multiplied with the existing 
information gain to create a new gain that serves as the decision tree’s attribute selection criterion [46]. 
Similar to ID3, the tree’s node is determined by the largest value for the attribute selection criterion. This 
approach generates a new criterion for the ID3 algorithm’s attribute splitting [86]. 


Gain'(A) = (1(s4,S2,--+,Sm) — E(A)) * V(A) (5) 


According to [82], AF overcame the limitations of the classic ID3 algorithm involving multi-value 
bias because the generated decision tree chose age with three attribute values as the root. In contrast, the 
classic ID3 algorithm chose colour-cloth with four attribute values as the root when applied to the customer 
dataset. According to [97], when applied to the mutual fund application dataset, AF created a more plausible 
tree because the DT chose status with two attribute values as the root instead of the standard ID3 approach, 
which chose age with three attribute values as the root. The application of AF reduces the significance of 
attributes with a large number of values. To address the greedy algorithm’s flaws, it enhances the relevance 
of characteristics with fewer values and uses the relation degree values to express the attributes’ importance 
[82]. Sivakumar et al. [55] further demonstrated that when the student dropout dataset was used, the modified 
DT method had greater accuracy than the classic ID3 algorithm, at 97.5% versus 92.5%. Implementing AF 
can help alleviate the overfitting problem and increase the generalisation of the model [55]. Additionally, AF 
produces more productive rules for the decision tree since it generates more optimal DT [97]. 


2.5. Proposed method: fuzzy-ID3 algorithm based on association function (FID3-AF) 

FID3-AF will classify the fuzzified data by utilising a hybrid ID3 algorithm with AF as the classifier 
and configure the required fuzzy set parameters for the fuzzification process using the FUZZYDBD method. 
FID3-AF is a simple, comprehensible and interpretable method. FID3-AF fuzzified the dataset and estimated 
each numerical attribute’s membership degree, including the attribute with integer or continuous values. The 
test data would be applied automatically using the most compatible rules with the input pattern to determine 
the classes. The proposed approach, which fuzzifies the entire dataset, enhances patients’ privacy by 
concealing specific patient data. Medical secrecy is critical for patient records’ security and confidentiality. 
The linguistic variable is beneficial, especially when a coalition of support for the linguistic words spans the 
whole domain, greatly improving accuracy and performance [98]. 

The FID3-AF implementation begins with listwise deletion to handle missing data, followed by the 
construction of membership functions for all continuous features in the datasets using the FUZZYDBD 
method. Each gathered dataset’s numerical attributes with continuous values are defined using triangular 
equal partitioning membership functions and a predefined number of fuzzy sets (all domain attributes have 
the same count of fuzzy sets). Three, five, or seven fuzzy sets are the most often utilised number of fuzzy sets 
in the FUZZYDBD method study [95], [96]. All attributes in the datasets were assigned a value of three 
because this is also the best-estimated number of fuzzy sets in the breast cancer domain, along with the value 
of two, unless a medical expert determined that a different number of fuzzy sets should be used for the 
respective attribute [44], [49]. The dataset’s attribute values are then fuzzified to create data with fuzzy 
values. All integer and continuous attribute data are replaced with linguistic labels from fuzzy sets with the 
maximum degree of compatibility with the input values or fuzzy values to assure the linguistic formation of 
the newly generated dataset. After fuzzifying the dataset, the training data and testing data are separated 
using a 10-fold cross-validation technique. Following that, the training data would be classified using the ID3 
algorithm that had been undergone modification with the association function. FID3-AF’s classification 
method is identical to the standard ID3 algorithm but includes AF, normalised gain, and new gain 
(modification of information gain) as the new computation technique. The enhanced decision tree, FID3-AF, 
employed new gain as an attribute selection criterion. The fuzzified data is utilised to construct an upgraded 
fuzzy decision tree, and the fuzzy decision tree’s generated rules are used to evaluate the method’s 
effectiveness. The most compatible rules were applied to deduce the classes of test data. Finally, the 
classification rates would be ascertain using the testing data classification performance. The flow of FID3-AF 
is seen in Figure 2. 
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Implement list wise deletion for missing data 


Generate equally spacing triangular membership functions 
for continuous attributes following FUZZYDBD 


Fuzzified the respective attributes based on the generated 
membership functions 


Replacement with linguistic variable of the fuzzy sets with 
highest compatibility degree (data in linguistic form) 


Apply the newly generated data into the model as input 
value 
Split the training and testing sets using K-fold cross 
validation 


Modified ID3 algorithm with association function for 
classification (new gain for attribute selection criteria) 


Generate rules and test the classification ability using testing 
set 


Figure 2. Flow of FID3-AF 


3. RESULTS AND DISCUSSION 

This study utilised four breast cancer datasets: the wisconsin breast cancer dataset (original), which 
can be referred to as WBCD (original), the wisconsin diagnostic breast cancer (diagnostic), which can be 
referred to as WDBC (diagnostic), the Coimbra dataset, and the mammographic mass dataset. All datasets 
were collected from the University of California Irvine machine learning repository. The WBCD (original), 
WDBC (diagnostic), and Mammographic Mass datasets are used to differentiate between benign and 
malignant samples, whilst the Coimbra dataset is used to identify healthy controls and patients. Table 1 
contains descriptions of the datasets that were gathered. 

The proposed method’s effectiveness is determined using a 10-fold cross-validation technique, and 
missing data are handled via listwise deletion. The study defined all attributes in the datasets using three 
evenly distributed triangular fuzzy sets for fuzzification. The confusion matrix is utilised to demonstrate the 
proposed method’s classification capability through the computation of the accuracy, precision, recall, and 
Fl-measure. The results from ten independent runs were averaged using the macro-average approach. The 
precision, recall, Fl-measure, and standard deviation of the accuracy (SD) are tabulated in Table 2. 

The proposed method’s accuracy was compared to other existing techniques in order to establish 
and validate the proposed model’s capabilities. Tables 3 to 6 compare the proposed method’s accuracy to 
existing techniques for each of the collected datasets. When the approach does not support the classification, 
a (-) is indicated. The results show that the proposed method, FID3-AF, acquired the highest accuracies 
compared to other existing methods. 
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Table 1. The number of attributes, instances and classes for the datasets 


Dataset Number of attributes Number of instances Number of classes 
WBCD (Original) 10 683 2 
WDBC (Diagnostic) 32 569 2 
Coimbra 10 116 2: 
Mammographic Mass 6 830 2 


Table 2. Experimental results of the collected datasets 


Dataset Precision (%) Recall (%) Fl-Measure (%) SD (decimal) 
WBCD (Original) 94.425 92.992 93.641 0.0046 
WDEC (Diagnostic) 94.194 93.983 94.118 0.0030 
Mammographic Mass 81.423 81.232 81.267 0.0064 
Coimbra 70.040 68.959 69.229 0.0167 


Table 3. Accuracy of the methods using the WBCD (Original) dataset 


Method Accuracy (%) 
Kwok’ SVM Gaussian [99] 91.600 
DT (C4.5) [100] 92.970 
Linear regression [101] 79.840 
ID3 algorithm 92.650 
Proposed method 94.305 


Table 4. Accuracy of the methods using the WDBC (Diagnostic) dataset 


Method Accuracy (%) 
RBF Network [102] 93.670 
Naive Bayes [103] 90.351 
Random Forest-CFS [104] 89.510 
ID3 algorithm - 
Proposed method 94.517 


Table 5. Accuracy of the methods using the Coimbra dataset 


Method Accuracy (%) 
Average K-means + Local Outlier Rectifier V.2.0 56.900 
[105] 
Neural Network [106] 60.000 
Random Forest-ReliefF [107] 68.000 
1D3 algorithm - 
Proposed method 70.862 


Table 6. Accuracy of the methods using the Mammographic Mass dataset 


Method Accuracy (%) 
Random Tree-SVR [108] 80.440 
RBF Network [109] 80.190 
CHAID [110] 78.130 
ID3 algorithm 74.542 
Proposed method 81.325 


The radar charts were created to evaluate and compare the FID3-AF algorithm’s overall classification 
performance with the existing ID3 algorithm. The ID3 algorithm cannot classify the WDBC and Coimbra 
datasets because it cannot handle real-valued data or continuous values with floating points. Thus, performance 
comparisons between the two methods were undertaken solely based on the WBCD dataset, which contains 
integer values ranging from | to 10, and the Mammographic Mass dataset, which has only one continuous 
attribute. The radar chart for the WBCD dataset classification is shown in Figure 3, while the radar chart for the 
Mammographic Mass dataset classification is shown in Figure 4. FID3-AF outperforms the ID3 algorithm in 
terms of accuracy, precision, recall, and Fl-measure for both datasets. 

A t-test was performed to find out if the classification accuracy of the ID3 algorithm and FID3-AF is 
statistically different. The t-value and p-value were determined by analysing the accuracy of ten runs per sample 
with a total degree of freedom of 18. The significance level (a) was adjusted to 0.05, and a two-tailed test was 
used. If the p-value is greater than a, the null hypothesis, HO, is adopted, stating that there is no significant 
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difference between ID3 and FID3-AF. Otherwise, if the p-value is less than a, the alternative hypothesis, H1, 
that asserts a substantial difference exists is adopted. The t-test result for the WBCD dataset classification 
returns a t-value of 7.28533 and a p-value less than 0.00001. Thus, the results between the two methods for 
WBCD dataset classification are significantly different at p less than 0.05. Then, the t-test result for the 
Mammographic Mass dataset classification is 20.27207, with a p-value less than 0.00001. Therefore, the 
Mammographic Mass dataset classification result is also significantly different at p less than 0.05. According to 
this research, all p-values were lower than the level of significance. Thus, it shows that the results of ID3 and 
FID3-AF are statistically and significantly different at a p-value less than 0.05, where the null hypothesis, HO, is 
rejected and the alternative hypothesis, H1, is accepted. 


Radar Chart of WBCD Dataset Classification 


Accuracy 
96 
seeeee » 1D3 ID3-AF 
F1 Measure Precision 
Recall 
Figure 3. Radar chart of WBCD dataset classification 
Radar Chart of Mammographic Mass Dataset 
Classification 
Accuracy 
85 
coeeeee ID3 ID3-AF 
F1-Measure Precision 


Recall 


Figure 4. Radar chart of mammographic mass dataset classification 


FID3-AF achieved an accuracy of 94.305% when tested against the WBCD dataset, 94.517% when 
tested against the WDBC dataset, 70.862% when tested against the Coimbra dataset, and 81.325% when 
tested against the Mammographic Mass dataset. The total results indicated that FID3-AF outperformed and 
was comparable to a large number of previously published methods. The standard ID3 algorithm is incapable 
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of classifying datasets that contain continuous values with floating points, such as the WDBC and Coimbra 
datasets. FID3-AF is more practical than classic [D3 in terms of solving breast cancer classification problems 
since it has demonstrated superior classification results and has the ability to handle continuous data. The 
FUZZYDBD method was widely used for data fuzzification in the literature and had shown exceptional 
performance when evaluated on a breast cancer dataset. In FID3-AF, the FUZZYDBD technique was used to 
configure the fuzzy sets parameters in order to increase the efficacy of data fuzzification, notably in the 
breast cancer field. Additionally, FID3-AF performed a replacement operation with the most compatible 
fuzzy set’s linguistic variable, converting the dataset’s continuous attributes to linguistic forms. Thus, any 
gathered datasets can be classified, including those with continuous values. The use of data fuzzification and 
the linguistic variable replacement process in FID3-AF enabled the decision tree to choose the optimal rules 
for new instances based on logical reasoning. The implementation of AF addressed the shortcomings of the 
ID3 algorithm, which included high variance, poor generalisation ability, and a proclivity for overfitting. 
Furthermore, FID3-AF also proved to be more robust than ID3. In ID3, even discrepancies of 0.01 
continuous values in the data would result in a tree with different routes and classes, whereas FID3-AF 
evaluates the membership degree of each input and has a high tolerance for data uncertainty. In future 
research, the feature selection approach may be utilised to improve classification performance. 


4. CONCLUSION 

FID3-AF is efficient and produces superior classification results for breast cancer data. The 
FUZZYDBD method is simple and effective, allowing for a rapid fuzzification process to occur. The 
proposed FID3-AF method is simple to comprehend, performs well in classification, and alleviates the 
overfitting and generalisation issues associated with the classic ID3. Moreover, FID3-AF overcomes the 
drawback of the classic [D3 algorithm, which is unable to classify data with continuous values. In general, 
the FID3-AF is beneficial and productive for data classification. 
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